Search results for "Normalized compression distance"

showing 3 items of 3 documents

Comparison of genomic sequences clustering using Normalized Compression Distance and Evolutionary Distance

2008

Genomic sequences are usually compared using evolutionary distance, a procedure that implies the alignment of the sequences. Alignment of long sequences is a long procedure and the obtained dissimilarity results is not a metric. Recently the normalized compression distance was introduced as a method to calculate the distance between two generic digital objects, and it seems a suitable way to compare genomic strings. In this paper the clustering and the mapping, obtained using a SOM, with the traditional evolutionary distance and the compression distance are compared in order to understand if the two distances sets are similar. The first results indicate that the two distances catch differen…

Kolmogorov complexityuniversal similarity metricComputer sciencebusiness.industryDNA sequencePattern recognitionGenomic Sequence ClusteringCompression (functional analysis)Normalized compression distanceArtificial intelligenceCluster analysisbusinessDistance matrices in phylogenyclustering

researchProduct

Normalised compression distance and evolutionary distance of genomic sequences: comparison of clustering results

2009

Genomic sequences are usually compared using evolutionary distance, a procedure that implies the alignment of the sequences. Alignment of long sequences is a time consuming procedure and the obtained dissimilarity results is not a metric. Recently, the normalised compression distance was introduced as a method to calculate the distance between two generic digital objects and it seems a suitable way to compare genomic strings. In this paper, the clustering and the non-linear mapping obtained using the evolutionary distance and the compression distance are compared, in order to understand if the two distances sets are similar.

Settore ING-INF/05 - Sistemi Di Elaborazione Delle Informazionibusiness.industryCompression (functional analysis)Metric (mathematics)Normalized compression distanceuniversal similarity metric USM clustering DNA sequences normalised compression distance evolutionary distance genomic sequences nonlinear mapping bioinformaticsPattern recognitionArtificial intelligenceCluster analysisbusinessDistance matrices in phylogenyMathematics

researchProduct

Learning Similarity Scores by Using a Family of Distance Functions in Multiple Feature Spaces

2017

There exist a large number of distance functions that allow one to measure similarity between feature vectors and thus can be used for ranking purposes. When multiple representations of the same object are available, distances in each representation space may be combined to produce a single similarity score. In this paper, we present a method to build such a similarity ranking out of a family of distance functions. Unlike other approaches that aim to select the best distance function for a particular context, we use several distances and combine them in a convenient way. To this end, we adopt a classical similarity learning approach and face the problem as a standard supervised machine lea…

Training setbusiness.industryFeature vectorSimilarity heuristicPattern recognition02 engineering and technologyMachine learningcomputer.software_genreSemantic similarityArtificial Intelligence020204 information systemsNormalized compression distance0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingComputer Vision and Pattern RecognitionArtificial intelligenceJaro–Winkler distancebusinesscomputerClassifier (UML)SoftwareSimilarity learningMathematicsInternational Journal of Pattern Recognition and Artificial Intelligence

researchProduct